Nof1, an artificial intelligence research lab focused on financial marekt, began a large-scale model trading test AlphaArena on the 18th. The test uses six mainstream AI large models (GPT-5, Gemini 2.5 Pro, Grok-4, ClaudeSonnet 4.5, DeepSeek V3.1, Qwen3 Max), each of which receives $10,000 in real money on Hyperliquid, with the same hints and input data. As of press time, DeepSeek and Qwen3 Max have doubled their revenue and are leading the way.
专注于金融市场的人工智能研究实验室nof1于18日开始一项大模型交易测试AlphaArena。该测试使用6个主流AI大模型(GPT-5、Gemini 2.5 Pro、Grok-4、ClaudeSonnet 4.5、DeepSeek V3.1、Qwen3 Max),每个模型都在Hyperliquid上获得10,000美元的真实资金,并具有相同的提示和输入数据。 截止发稿,DeepSeek和Qwen3 Max收益已实现翻倍,断崖式领先。
On September 24th, according to the official account of Tongyi Qianwen Qwen: Following the release of the Qwen3-2507 series, we are very pleased to introduce Qwen3-Max - our largest and most capable model to date. Currently, the preview version of Qwen3-Max-Instruct ranks third on the LMArena text chart, surpassing GPT-5-Chat. The official version further improves in code capabilities and agent capabilities, achieving industry-leading levels in comprehensive benchmarks covering knowledge, reason...
Alibaba (09988.HK) rose as much as 4.6%, hitting a three-year, 11-month high after releasing its largest and most capable model to date, the Qwen3-Max.
9月24日讯,据通义千问Qwen公众号消息:继 Qwen3-2507 系列发布之后,我们非常高兴地推出 Qwen3-Max —— 我们迄今为止规模最大、能力最强的模型。目前,Qwen3-Max-Instruct 的预览版在 LMArena 文本排行榜上位列第三,超越了 GPT-5-Chat。正式版本在代码能力和智能体(agent)能力方面进一步提升,在涵盖知识、推理、编程、指令遵循、人类偏好对齐、智能体任务和多语言理解的全面基准测试中均达到业界领先水平。
阿里巴巴(09988.HK)涨幅一度扩大至4.6%,股价创三年11个月新高;此前发布迄今为止规模最大、能力最强的模型 Qwen3-Max。
Tongyi Qianwen: Launches Qwen3-VL - the most powerful visual-language model in the Qwen series to date. The flagship model, Qwen3-VL-235B-A22B, is now open-source and available in Instruct and Thinking editions, which outperform Gemini 2.5 Pro on key visual tasks.
Tongyi Qianwen, a subsidiary of Alibaba, has released the next-generation basic model architecture Qwen3-Next and open-sourced the Qwen3-Next-80B-A3B series of models based on this architecture. Compared with the MoE model structure of Qwen3, the structure has the following core improvements: mixed attention mechanism, high sparsity MoE structure, a series of training stability-friendly optimizations, and a multi-token prediction mechanism to improve inference efficiency. Based on the model stru...
阿里巴巴旗下通义千问发布了下一代基础模型架构Qwen3-Next,并开源了基于该架构的Qwen3-Next-80B-A3B系列模型。该结构相比Qwen3的MoE模型结构,进行了以下核心改进:混合注意力机制、高稀疏度MoE结构、一系列训练稳定友好的优化,以及提升推理效率的多token预测机制。基于Qwen3-Next的模型结构,阿里训练了Qwen3-Next-80B-A3B-Base模型,该模型拥有800亿参数仅激活30亿参数。该...